Model Selection

Efficient Pretraining

# Efficient Pretraining

GenMedClip is a zero-shot image classification model based on the open_clip library, specializing in medical image analysis.

Image Classification

Rho Math 1b V0.1

Rho-1 is a language model specialized in mathematics, pretrained with Selective Language Modeling (SLM) method, significantly improving accuracy in solving mathematical problems.

Large Language Model

Transformers English

Tinyllama V1.1 Math Code

TinyLlama is a compact language model with 1.1 billion parameters, adopting the same architecture and tokenizer as Llama 2, suitable for applications with limited computational and memory resources.

Large Language Model

Transformers English

Tinyllama 1.1B Intermediate Step 1431k 3T

TinyLlama is a 1.1B parameter Llama model pretrained on 3 trillion tokens, designed to provide compact and efficient text generation capabilities.

Large Language Model

Transformers English

Tinyllama 1.1B Intermediate Step 1195k Token 2.5T

TinyLlama is a compact 1.1B-parameter Llama model pretrained on 3 trillion tokens, designed for resource-constrained environments.

Large Language Model

Transformers English

Sheared LLaMA 2.7B

Sheared-LLaMA-2.7B is a lightweight language model derived from Llama-2-7b through pruning and continued pretraining, consuming only a 50B token budget.

Large Language Model

Tinyllama 1.1B Step 50K 105b

TinyLlama is a 1.1B parameter Llama model, planned to be pretrained on 3 trillion tokens, optimized to complete training in 90 days on 16 A100-40G GPUs.

Large Language Model

Transformers English

CodeT5+ 16B is an open-source family of large language models for code, featuring an encoder-decoder architecture that supports multiple modes, suitable for a wide range of code understanding and generation tasks.

Large Language Model

Videomae Small Finetuned Kinetics

VideoMAE is a masked autoencoder model for video, pretrained with self-supervision and fine-tuned on the Kinetics-400 dataset, suitable for video classification tasks.

Video Processing

Videomae Huge Finetuned Kinetics

VideoMAE is a video pretraining model based on Masked Autoencoder (MAE), fine-tuned on the Kinetics-400 dataset through self-supervised learning, suitable for video classification tasks.

Video Processing

Indobertweet Base Uncased

The first pre-trained language model specifically for Indonesian Twitter, built by extending Indonesian BERT with domain-specific vocabulary

Large Language Model

Transformers Other

Arabictransformer Base

An efficient Arabic language model based on Funnel Transformer and ELECTRA objective, with low computational cost and superior performance

Large Language Model

Bertin Roberta Base Spanish

BERTIN is a series of Spanish BERT-based models. The current model is a RoBERTa-base model trained from scratch on a portion of the Spanish mC4 dataset using Flax.

Large Language Model Spanish

Bert Base Uncased Sparse 90 Unstructured Pruneofa

This is a sparsely pretrained BERT-Base model achieving 90% weight sparsity through one-shot pruning, suitable for fine-tuning on various language tasks.

Large Language Model

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase